120 research outputs found

    Protein Bioinformatics Infrastructure for the Integration and Analysis of Multiple High-Throughput “omics” Data

    Get PDF
    High-throughput “omics” technologies bring new opportunities for biological and biomedical researchers to ask complex questions and gain new scientific insights. However, the voluminous, complex, and context-dependent data being maintained in heterogeneous and distributed environments plus the lack of well-defined data standard and standardized nomenclature imposes a major challenge which requires advanced computational methods and bioinformatics infrastructures for integration, mining, visualization, and comparative analysis to facilitate data-driven hypothesis generation and biological knowledge discovery. In this paper, we present the challenges in high-throughput “omics” data integration and analysis, introduce a protein-centric approach for systems integration of large and heterogeneous high-throughput “omics” data including microarray, mass spectrometry, protein sequence, protein structure, and protein interaction data, and use scientific case study to illustrate how one can use varied “omics” data from different laboratories to make useful connections that could lead to new biological knowledge

    UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches

    Get PDF
    Motivation: UniRef databases provide full-scale clustering of UniProtKB sequences and are utilized for a broad range of applications, particularly similarity-based functional annotation. Non-redundancy and intra-cluster homogeneity in UniRef were recently improved by adding a sequence length overlap threshold. Our hypothesis is that these improvements would enhance the speed and sensitivity of similarity searches and improve the consistency of annotation within clusters. Results: Intra-cluster molecular function consistency was examined by analysis of Gene Ontology terms. Results show that UniRef clusters bring together proteins of identical molecular function in more than 97% of the clusters, implying that clusters are useful for annotation and can also be used to detect annotation inconsistencies. To examine coverage in similarity results, BLASTP searches against UniRef50 followed by expansion of the hit lists with cluster members demonstrated advantages compared with searches against UniProtKB sequences; the searches are concise (∼7 times shorter hit list before expansion), faster (∼6 times) and more sensitive in detection of remote similarities (>96% recall at e-value <0.0001). Our results support the use of UniRef clusters as a comprehensive and scalable alternative to native sequence databases for similarity searches and reinforces its reliability for use in functional annotation. Availability and implementation: Web access and file download from UniProt website at http://www.uniprot.org/uniref and ftp://ftp.uniprot.org/pub/databases/uniprot/uniref. BLAST searches against UniRef are available at http://www.uniprot.org/blast/ Contact: [email protected]

    Treatable childhood neuronopathy caused by mutations in riboflavin transporter RFVT2.

    Get PDF
    Childhood onset motor neuron diseases or neuronopathies are a clinically heterogeneous group of disorders. A particularly severe subgroup first described in 1894, and subsequently called Brown-Vialetto-Van Laere syndrome, is characterized by progressive pontobulbar palsy, sensorineural hearing loss and respiratory insufficiency. There has been no treatment for this progressive neurodegenerative disorder, which leads to respiratory failure and usually death during childhood. We recently reported the identification of SLC52A2, encoding riboflavin transporter RFVT2, as a new causative gene for Brown-Vialetto-Van Laere syndrome. We used both exome and Sanger sequencing to identify SLC52A2 mutations in patients presenting with cranial neuropathies and sensorimotor neuropathy with or without respiratory insufficiency. We undertook clinical, neurophysiological and biochemical characterization of patients with mutations in SLC52A2, functionally analysed the most prevalent mutations and initiated a regimen of high-dose oral riboflavin. We identified 18 patients from 13 families with compound heterozygous or homozygous mutations in SLC52A2. Affected individuals share a core phenotype of rapidly progressive axonal sensorimotor neuropathy (manifesting with sensory ataxia, severe weakness of the upper limbs and axial muscles with distinctly preserved strength of the lower limbs), hearing loss, optic atrophy and respiratory insufficiency. We demonstrate that SLC52A2 mutations cause reduced riboflavin uptake and reduced riboflavin transporter protein expression, and we report the response to high-dose oral riboflavin therapy in patients with SLC52A2 mutations, including significant and sustained clinical and biochemical improvements in two patients and preliminary clinical response data in 13 patients with associated biochemical improvements in 10 patients. The clinical and biochemical responses of this SLC52A2-specific cohort suggest that riboflavin supplementation can ameliorate the progression of this neurodegenerative condition, particularly when initiated soon after the onset of symptoms

    An Integrative Genomic and Epigenomic Approach for the Study of Transcriptional Regulation

    Get PDF
    The molecular heterogeneity of acute leukemias and other tumors constitutes a major obstacle towards understanding disease pathogenesis and developing new targeted-therapies. Aberrant gene regulation is a hallmark of cancer and plays a central role in determining tumor phenotype. We predicted that integration of different genome-wide epigenetic regulatory marks along with gene expression levels would provide greater power in capturing biological differences between leukemia subtypes. Gene expression, cytosine methylation and histone H3 lysine 9 (H3K9) acetylation were measured using high-density oligonucleotide microarrays in primary human acute myeloid leukemia (AML) and acute lymphocytic leukemia (ALL) specimens. We found that DNA methylation and H3K9 acetylation distinguished these leukemias of distinct cell lineage, as expected, but that an integrative analysis combining the information from each platform revealed hundreds of additional differentially expressed genes that were missed by gene expression arrays alone. This integrated analysis also enhanced the detection and statistical significance of biological pathways dysregulated in AML and ALL. Integrative epigenomic studies are thus feasible using clinical samples and provide superior detection of aberrant transcriptional programming than single-platform microarray studies

    Systems Integration of Biodefense Omics Data for Analysis of Pathogen-Host Interactions and Identification of Potential Targets

    Get PDF
    The NIAID (National Institute for Allergy and Infectious Diseases) Biodefense Proteomics program aims to identify targets for potential vaccines, therapeutics, and diagnostics for agents of concern in bioterrorism, including bacterial, parasitic, and viral pathogens. The program includes seven Proteomics Research Centers, generating diverse types of pathogen-host data, including mass spectrometry, microarray transcriptional profiles, protein interactions, protein structures and biological reagents. The Biodefense Resource Center (www.proteomicsresource.org) has developed a bioinformatics framework, employing a protein-centric approach to integrate and support mining and analysis of the large and heterogeneous data. Underlying this approach is a data warehouse with comprehensive protein + gene identifier and name mappings and annotations extracted from over 100 molecular databases. Value-added annotations are provided for key proteins from experimental findings using controlled vocabulary. The availability of pathogen and host omics data in an integrated framework allows global analysis of the data and comparisons across different experiments and organisms, as illustrated in several case studies presented here. (1) The identification of a hypothetical protein with differential gene and protein expressions in two host systems (mouse macrophage and human HeLa cells) infected by different bacterial (Bacillus anthracis and Salmonella typhimurium) and viral (orthopox) pathogens suggesting that this protein can be prioritized for additional analysis and functional characterization. (2) The analysis of a vaccinia-human protein interaction network supplemented with protein accumulation levels led to the identification of human Keratin, type II cytoskeletal 4 protein as a potential therapeutic target. (3) Comparison of complete genomes from pathogenic variants coupled with experimental information on complete proteomes allowed the identification and prioritization of ten potential diagnostic targets from Bacillus anthracis. The integrative analysis across data sets from multiple centers can reveal potential functional significance and hidden relationships between pathogen and host proteins, thereby providing a systems approach to basic understanding of pathogenicity and target identification

    Impact of nonoptimal intakes of saturated, polyunsaturated, and trans fat on global burdens of coronary heart disease

    Get PDF
    Background: Saturated fat (SFA), ω‐6 (n‐6) polyunsaturated fat (PUFA), and trans fat (TFA) influence risk of coronary heart disease (CHD), but attributable CHD mortalities by country, age, sex, and time are unclear. Methods and Results: National intakes of SFA, n‐6 PUFA, and TFA were estimated using a Bayesian hierarchical model based on country‐specific dietary surveys; food availability data; and, for TFA, industry reports on fats/oils and packaged foods. Etiologic effects of dietary fats on CHD mortality were derived from meta‐analyses of prospective cohorts and CHD mortality rates from the 2010 Global Burden of Diseases study. Absolute and proportional attributable CHD mortality were computed using a comparative risk assessment framework. In 2010, nonoptimal intakes of n‐6 PUFA, SFA, and TFA were estimated to result in 711 800 (95% uncertainty interval [UI] 680 700–745 000), 250 900 (95% UI 236 900–265 800), and 537 200 (95% UI 517 600–557 000) CHD deaths per year worldwide, accounting for 10.3% (95% UI 9.9%–10.6%), 3.6%, (95% UI 3.5%–3.6%) and 7.7% (95% UI 7.6%–7.9%) of global CHD mortality. Tropical oil–consuming countries were estimated to have the highest proportional n‐6 PUFA– and SFA‐attributable CHD mortality, whereas Egypt, Pakistan, and Canada were estimated to have the highest proportional TFA‐attributable CHD mortality. From 1990 to 2010 globally, the estimated proportional CHD mortality decreased by 9% for insufficient n‐6 PUFA and by 21% for higher SFA, whereas it increased by 4% for higher TFA, with the latter driven by increases in low‐ and middle‐income countries. Conclusions: Nonoptimal intakes of n‐6 PUFA, TFA, and SFA each contribute to significant estimated CHD mortality, with important heterogeneity across countries that informs nation‐specific clinical, public health, and policy priorities.peer-reviewe

    Generation and characterization of standardized forms of trehalose dihydrate and their associated solid-state behavior

    Get PDF
    Trehalose dihydrate is a nonreducing disaccharide which has generated great interest in the food and pharmaceutical industries. However, it is well recognized that considerable batch to batch variation exists for supposedly identical samples, particularly in terms of the thermal response. In this investigation, two standardized forms of trehalose dihydrate were generated using two distinct crystallization pathways. The two batches were characterized using scanning electron microscopy, X-ray powder diffraction, and FTIR. The thermal responses of the two forms were then studied using modulated temperature differential scanning calorimetry (MTDSC) and thermogravimetric analysis (TGA). In particular, we describe the technique of quasi-isothermal MTDSC as a means of studying the change in equilibrium heat capacity as a function of temperature. Finally, variable temperature FTIR was utilized to assess the change in bonding configuration as a function of temperature. SEM revealed significant differences in the continuity and grain structure of the two batches. The TGA, MTDSC, and quasi-isothermal MTDSC studies all indicated significant differences in the thermal response and water loss profile. This was confirmed using variable temperature FTIR which indicated differences in bond reconfiguration as a function of temperature. We ascribe these differences to variations in the route by which water may leave the structure, possibly associated with grain size. The study has therefore demonstrated that chemically identical dihydrate forms may show significant differences in thermal response. We believe that this may assist in interpreting and hence controlling interbatch variation for this material

    A novel formulation of inhaled sodium cromoglicate (PA101) in idiopathic pulmonary fibrosis and chronic cough: a randomised, double-blind, proof-of-concept, phase 2 trial

    Get PDF
    Background Cough can be a debilitating symptom of idiopathic pulmonary fibrosis (IPF) and is difficult to treat. PA101 is a novel formulation of sodium cromoglicate delivered via a high-efficiency eFlow nebuliser that achieves significantly higher drug deposition in the lung compared with the existing formulations. We aimed to test the efficacy and safety of inhaled PA101 in patients with IPF and chronic cough and, to explore the antitussive mechanism of PA101, patients with chronic idiopathic cough (CIC) were also studied. Methods This pilot, proof-of-concept study consisted of a randomised, double-blind, placebo-controlled trial in patients with IPF and chronic cough and a parallel study of similar design in patients with CIC. Participants with IPF and chronic cough recruited from seven centres in the UK and the Netherlands were randomly assigned (1:1, using a computer-generated randomisation schedule) by site staff to receive PA101 (40 mg) or matching placebo three times a day via oral inhalation for 2 weeks, followed by a 2 week washout, and then crossed over to the other arm. Study participants, investigators, study staff, and the sponsor were masked to group assignment until all participants had completed the study. The primary efficacy endpoint was change from baseline in objective daytime cough frequency (from 24 h acoustic recording, Leicester Cough Monitor). The primary efficacy analysis included all participants who received at least one dose of study drug and had at least one post-baseline efficacy measurement. Safety analysis included all those who took at least one dose of study drug. In the second cohort, participants with CIC were randomly assigned in a study across four centres with similar design and endpoints. The study was registered with ClinicalTrials.gov (NCT02412020) and the EU Clinical Trials Register (EudraCT Number 2014-004025-40) and both cohorts are closed to new participants. Findings Between Feb 13, 2015, and Feb 2, 2016, 24 participants with IPF were randomly assigned to treatment groups. 28 participants with CIC were enrolled during the same period and 27 received study treatment. In patients with IPF, PA101 reduced daytime cough frequency by 31·1% at day 14 compared with placebo; daytime cough frequency decreased from a mean 55 (SD 55) coughs per h at baseline to 39 (29) coughs per h at day 14 following treatment with PA101, versus 51 (37) coughs per h at baseline to 52 (40) cough per h following placebo treatment (ratio of least-squares [LS] means 0·67, 95% CI 0·48–0·94, p=0·0241). By contrast, no treatment benefit for PA101 was observed in the CIC cohort; mean reduction of daytime cough frequency at day 14 for PA101 adjusted for placebo was 6·2% (ratio of LS means 1·27, 0·78–2·06, p=0·31). PA101 was well tolerated in both cohorts. The incidence of adverse events was similar between PA101 and placebo treatments, most adverse events were mild in severity, and no severe adverse events or serious adverse events were reported. Interpretation This study suggests that the mechanism of cough in IPF might be disease specific. Inhaled PA101 could be a treatment option for chronic cough in patients with IPF and warrants further investigation

    Repositioning of the global epicentre of non-optimal cholesterol

    Get PDF
    High blood cholesterol is typically considered a feature of wealthy western countries(1,2). However, dietary and behavioural determinants of blood cholesterol are changing rapidly throughout the world(3) and countries are using lipid-lowering medications at varying rates. These changes can have distinct effects on the levels of high-density lipoprotein (HDL) cholesterol and non-HDL cholesterol, which have different effects on human health(4,5). However, the trends of HDL and non-HDL cholesterol levels over time have not been previously reported in a global analysis. Here we pooled 1,127 population-based studies that measured blood lipids in 102.6 million individuals aged 18 years and older to estimate trends from 1980 to 2018 in mean total, non-HDL and HDL cholesterol levels for 200 countries. Globally, there was little change in total or non-HDL cholesterol from 1980 to 2018. This was a net effect of increases in low- and middle-income countries, especially in east and southeast Asia, and decreases in high-income western countries, especially those in northwestern Europe, and in central and eastern Europe. As a result, countries with the highest level of non-HDL cholesterol-which is a marker of cardiovascular riskchanged from those in western Europe such as Belgium, Finland, Greenland, Iceland, Norway, Sweden, Switzerland and Malta in 1980 to those in Asia and the Pacific, such as Tokelau, Malaysia, The Philippines and Thailand. In 2017, high non-HDL cholesterol was responsible for an estimated 3.9 million (95% credible interval 3.7 million-4.2 million) worldwide deaths, half of which occurred in east, southeast and south Asia. The global repositioning of lipid-related risk, with non-optimal cholesterol shifting from a distinct feature of high-income countries in northwestern Europe, north America and Australasia to one that affects countries in east and southeast Asia and Oceania should motivate the use of population-based policies and personal interventions to improve nutrition and enhance access to treatment throughout the world.Peer reviewe

    Global, regional, and national under-5 mortality, adult mortality, age-specific mortality, and life expectancy, 1970–2016: a systematic analysis for the Global Burden of Disease Study 2016

    Get PDF
    BACKGROUND: Detailed assessments of mortality patterns, particularly age-specific mortality, represent a crucial input that enables health systems to target interventions to specific populations. Understanding how all-cause mortality has changed with respect to development status can identify exemplars for best practice. To accomplish this, the Global Burden of Diseases, Injuries, and Risk Factors Study 2016 (GBD 2016) estimated age-specific and sex-specific all-cause mortality between 1970 and 2016 for 195 countries and territories and at the subnational level for the five countries with a population greater than 200 million in 2016. METHODS: We have evaluated how well civil registration systems captured deaths using a set of demographic methods called death distribution methods for adults and from consideration of survey and census data for children younger than 5 years. We generated an overall assessment of completeness of registration of deaths by dividing registered deaths in each location-year by our estimate of all-age deaths generated from our overall estimation process. For 163 locations, including subnational units in countries with a population greater than 200 million with complete vital registration (VR) systems, our estimates were largely driven by the observed data, with corrections for small fluctuations in numbers and estimation for recent years where there were lags in data reporting (lags were variable by location, generally between 1 year and 6 years). For other locations, we took advantage of different data sources available to measure under-5 mortality rates (U5MR) using complete birth histories, summary birth histories, and incomplete VR with adjustments; we measured adult mortality rate (the probability of death in individuals aged 15-60 years) using adjusted incomplete VR, sibling histories, and household death recall. We used the U5MR and adult mortality rate, together with crude death rate due to HIV in the GBD model life table system, to estimate age-specific and sex-specific death rates for each location-year. Using various international databases, we identified fatal discontinuities, which we defined as increases in the death rate of more than one death per million, resulting from conflict and terrorism, natural disasters, major transport or technological accidents, and a subset of epidemic infectious diseases; these were added to estimates in the relevant years. In 47 countries with an identified peak adult prevalence for HIV/AIDS of more than 0·5% and where VR systems were less than 65% complete, we informed our estimates of age-sex-specific mortality using the Estimation and Projection Package (EPP)-Spectrum model fitted to national HIV/AIDS prevalence surveys and antenatal clinic serosurveillance systems. We estimated stillbirths, early neonatal, late neonatal, and childhood mortality using both survey and VR data in spatiotemporal Gaussian process regression models. We estimated abridged life tables for all location-years using age-specific death rates. We grouped locations into development quintiles based on the Socio-demographic Index (SDI) and analysed mortality trends by quintile. Using spline regression, we estimated the expected mortality rate for each age-sex group as a function of SDI. We identified countries with higher life expectancy than expected by comparing observed life expectancy to anticipated life expectancy on the basis of development status alone. FINDINGS: Completeness in the registration of deaths increased from 28% in 1970 to a peak of 45% in 2013; completeness was lower after 2013 because of lags in reporting. Total deaths in children younger than 5 years decreased from 1970 to 2016, and slower decreases occurred at ages 5-24 years. By contrast, numbers of adult deaths increased in each 5-year age bracket above the age of 25 years. The distribution of annualised rates of change in age-specific mortality rate differed over the period 2000 to 2016 compared with earlier decades: increasing annualised rates of change were less frequent, although rising annualised rates of change still occurred in some locations, particularly for adolescent and younger adult age groups. Rates of stillbirths and under-5 mortality both decreased globally from 1970. Evidence for global convergence of death rates was mixed; although the absolute difference between age-standardised death rates narrowed between countries at the lowest and highest levels of SDI, the ratio of these death rates-a measure of relative inequality-increased slightly. There was a strong shift between 1970 and 2016 toward higher life expectancy, most noticeably at higher levels of SDI. Among countries with populations greater than 1 million in 2016, life expectancy at birth was highest for women in Japan, at 86·9 years (95% UI 86·7-87·2), and for men in Singapore, at 81·3 years (78·8-83·7) in 2016. Male life expectancy was generally lower than female life expectancy between 1970 and 2016, an
    corecore